Performance Improvement of Automatic Pathological Voice Quality Assessment Based on Higher-Order Statistics
نویسنده
چکیده
This thesis presents new parameters based on the HOS (Higher-Order Statistics) analysis to improve the classification performance of a multi-stage pathological voice assessment system. Automatic pathological diagnosis is a field which still demands further investigation mainly due to the difficulty in quantifying or standardizing the speech pathologists’ diagnoses. In recent years, various speech signal processing techniques have been proposed and applied for the voice disorder diagnosis. The objective is to quantitatively measure the degree of deviation of the pathological from the normal voice patterns with some acoustic analyses. And, objective supports of the diagnostics have some advantages to be adopted directly into the everyday life rather easily with less cost. Although most of the previous researches made novel contributions to the automatic detection of voice disorders and to voice quality assessment, their achievements are not easy to be compared with each other due to the lack of
منابع مشابه
Automatic Assessment of Pathological Voice Quality Using Higher-Order Statistics in the LPC Residual Domain
A preprocessing scheme based on linear prediction coefficient (LPC) residual is applied to higher-order statistics (HOSs) for automatic assessment of an overall pathological voice quality. The normalized skewness and kurtosis are estimated from the LPC residual and show statistically meaningful distributions to characterize the pathological voice quality. 83 voice samples of the sustained vowel...
متن کاملObjective Pathological Voice Quality Assessment Based on HOS Features
This work proposes new features to improve the pathological voice quality classification performance. They are the means, the variances, and the perturbations of the higher-order statistics (HOS) such as the skewness and the kurtosis. The HOS-based features show meaningful differences among normal, grade 1, grade 2, and grade 3 voices classified in the GRBAS scale. The jitter, the shimmer, the ...
متن کاملAutomatic GRBAS assessment using complexity measures and a multiclass GMM-based detector
this paper presents a system for the automatic assessment of pathological voice quality according to the Grbas protocol, which uses a short time scheme and a characterization based on 9 complexity measures, including conventional nonlinear statistics and 7 entropy based features. the classification is carried out using three different multiclass classification strategies all of them based on Ga...
متن کاملDeep Neural Networks for Voice Quality Assessment Based on the GRBAS Scale
In the field of voice therapy, perceptual evaluation is widely used by expert listeners as a way to evaluate pathological and normal voice quality. This approach is understandably subjective as it is subject to listeners’ bias which high interand intra-listeners variability can be found. As such, research on automatic assessment of pathological voices using a combination of subjective and objec...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کامل